Confirm Delete?
Are you sure you want to remove from the report?

Data Preview

Tail

num_bedrooms_per_room num_total_rooms num_median_income num_total_bedrooms num_longitude num_latitude num_housing_median_age num_population num_households median_house_value
16507 -0.883770 -0.223214 1.175476 -0.493429 0.732039 -0.676753 1.467256 -0.531045 -0.396344 300900.000000
16508 0.830777 -0.241719 -0.760754 0.101226 0.802025 -0.864110 -0.205853 0.155909 0.161708 236100.000000
16509 -1.188592 1.812299 1.584881 0.875728 -1.202583 1.014146 -0.922899 0.795677 0.910005 312500.000000
16510 -0.337287 0.403631 -0.231791 0.312981 -1.047614 0.503598 0.351850 0.684843 0.491466 158100.000000
16511 2.923794 -0.986532 -1.291824 -0.577551 0.652054 -0.751696 0.989225 0.046172 -0.548540 109600.000000

Pre Processing

Pre Processing - Imputations

Pre Processing - Imputations - Missing

No missing values in data

Pre Processing - Imputations - Infinitys

No Infinity values

Health Analysis

Health Plot

Missing Plot

Missing Value Summary

No Missing Values

Duplicate Columns

No duplicate variables

Outliers In Features

Data Shape:(16512, 10)
feature < (mean-3*std) > (mean+3*std) < (1stQ - 1.5 * IQR) > (3rdQ + 1.5 * IQR) -inf +inf
num_bedrooms_per_room 0 230 0 505 0 0
num_total_rooms 0 449 0 1037 0 0
num_median_income 0 316 0 533 0 0
num_total_bedrooms 0 448 0 1022 0 0
num_population 0 410 0 957 0 0
num_households 0 452 0 973 0 0
median_house_value 0 0 0 852 0 0

Feature Analysis

Summary Stats

Summary Stats - Numeric Variables

Variable Name Datatype No of Unique Samples Mean Standard Deviation Min 25th percentile Median 75th percentile Max
0 median_house_value float64 3671 [277600.0000000102, 216700.00000001022, 123200.00000001022, 249300.00000001022, 222000.00000001022] 206766.125485 115146.155150 14999.000000 119800.000000 180150.000000 264700.000000 500001.000000
1 num_bedrooms_per_room float64 15120 [-1.5146024036606813, 3.0000908471496897, -0.5707410441521086, 0.14529453001578047, 0.01125721386038621] -0.019360 0.900670 -1.970307 -0.639926 -0.183124 0.449931 3.000091
2 num_households float64 1336 [-0.5612234462159783, -0.39951516499498935, -1.0368360380424164, -0.3519539058123455, 0.13317093785062142] -0.013868 0.953139 -1.534644 -0.653175 -0.244148 0.367806 3.000091
3 num_housing_median_age float64 52 [-0.28552448669182645, 0.5908656031570931, -1.3212582292405497, 1.4672556930060128, -1.2415864028906478] -0.000000 1.000030 -2.197648 -0.843227 0.033163 0.670537 1.865615
4 num_latitude float64 842 [-1.3278190300432111, -0.7189081084316511, -0.83600636258772, 0.4661262236277625, 0.9204674497533124] 0.000000 1.000030 -1.444917 -0.793851 -0.643965 0.976675 2.962661
5 num_longitude float64 826 [1.246938194641592, 0.6270592693926523, 1.151956746417959, -1.1775882468562637, -1.2525736007170176] 0.000000 1.000030 -2.392351 -1.108852 0.532078 0.777030 2.626669
6 num_median_income float64 10649 [2.2797464694834613, -1.4734969697718119, -0.6297516301418874, -0.45058384445933847, 0.5668637239972327] -0.004475 0.985947 -1.895540 -0.726310 -0.169421 0.518798 3.000091
7 num_population float64 3230 [-0.5211690516569171, 0.13176720594646754, -0.9469493339260149, -0.4805662927807402, 0.3227099098506506] -0.015087 0.948073 -1.517583 -0.655048 -0.243534 0.365507 3.000091
8 num_total_bedrooms float64 1446 [-0.6036581650773796, -0.2961778882749543, -0.9082376845514801, -0.35129227751312486, 0.08382132173559016] -0.015059 0.948615 -1.508694 -0.650070 -0.258468 0.347790 3.000091
9 num_total_rooms float64 5031 [-0.03758930820388485, -1.0218061523074191, -0.8066894272742141, -0.4296568876998873, 0.013876789989543201] -0.017484 0.939109 -1.475749 -0.640726 -0.248080 0.336552 3.000091

Summary Stats - Non Numeric Variables

No categorical columns

Distributions

Distributions - Numeric Variables

Distributions - Numeric Variables - Median House Value

Distributions - Numeric Variables - Num Bedrooms Per Room

Distributions - Numeric Variables - Num Households

Distributions - Numeric Variables - Num Housing Median Age

Distributions - Numeric Variables - Num Latitude

Distributions - Numeric Variables - Num Longitude

Distributions - Numeric Variables - Num Median Income

Distributions - Numeric Variables - Num Population

Distributions - Numeric Variables - Num Total Bedrooms

Distributions - Numeric Variables - Num Total Rooms

Distributions - Non Numeric Variables

No categorical variables in data.

Feature Normality

Feature Interactions

Correlation Table

Variable 1 Variable 2 Corr Coef Abs Corr Coef
0 num_households num_total_bedrooms 0.969908 0.969908
1 num_latitude num_longitude -0.924018 0.924018
2 num_total_bedrooms num_total_rooms 0.916246 0.916246
3 num_households num_total_rooms 0.911062 0.911062
4 num_households num_population 0.906269 0.906269
5 num_population num_total_bedrooms 0.867682 0.867682
6 num_population num_total_rooms 0.835993 0.835993
7 median_house_value num_median_income 0.694318 0.694318
8 num_bedrooms_per_room num_median_income -0.676478 0.676478
9 num_housing_median_age num_total_rooms -0.386752 0.386752
10 num_housing_median_age num_total_bedrooms -0.334721 0.334721
11 num_households num_housing_median_age -0.315256 0.315256
12 num_housing_median_age num_population -0.312752 0.312752
13 median_house_value num_bedrooms_per_room -0.279251 0.279251
14 num_median_income num_total_rooms 0.240251 0.240251
15 num_bedrooms_per_room num_total_rooms -0.220888 0.220888
16 median_house_value num_total_rooms 0.158906 0.158906
17 median_house_value num_latitude -0.147527 0.147527
18 num_bedrooms_per_room num_housing_median_age 0.135581 0.135581
19 num_housing_median_age num_median_income -0.134962 0.134962
20 num_latitude num_population -0.121432 0.121432
21 num_bedrooms_per_room num_latitude -0.121279 0.121279
22 num_bedrooms_per_room num_total_bedrooms 0.114063 0.114063
23 num_longitude num_population 0.108916 0.108916
24 num_housing_median_age num_longitude -0.105459 0.105459
25 median_house_value num_housing_median_age 0.102605 0.102605
26 num_bedrooms_per_room num_longitude 0.099867 0.099867
27 num_bedrooms_per_room num_households 0.086910 0.086910
28 num_latitude num_median_income -0.080351 0.080351
29 num_households num_latitude -0.073925 0.073925
30 median_house_value num_households 0.073732 0.073732
31 num_longitude num_total_bedrooms 0.066143 0.066143
32 num_latitude num_total_bedrooms -0.064710 0.064710
33 num_bedrooms_per_room num_population 0.064110 0.064110
34 median_house_value num_total_bedrooms 0.055304 0.055304
35 num_households num_longitude 0.054335 0.054335
36 median_house_value num_longitude -0.042476 0.042476
37 num_longitude num_total_rooms 0.035345 0.035345
38 median_house_value num_population -0.033197 0.033197
39 num_latitude num_total_rooms -0.027724 0.027724
40 num_longitude num_median_income -0.019119 0.019119
41 num_households num_median_income 0.016288 0.016288
42 num_median_income num_total_bedrooms -0.011732 0.011732
43 num_housing_median_age num_latitude 0.007611 0.007611
44 num_median_income num_population 0.000711 0.000711

Correlation Heatmap

Covariance Heatmap

Bivariate Plots (top 50 Correlations)

Bivariate Plots (top 50 Correlations) - Num Total Bedrooms Vs Num Households

Bivariate Plots (top 50 Correlations) - Num Total Rooms Vs Num Total Bedrooms

Bivariate Plots (top 50 Correlations) - Num Total Rooms Vs Num Households

Bivariate Plots (top 50 Correlations) - Num Population Vs Num Households

Bivariate Plots (top 50 Correlations) - Num Total Bedrooms Vs Num Population

Bivariate Plots (top 50 Correlations) - Num Total Rooms Vs Num Population

Bivariate Plots (top 50 Correlations) - Num Median Income Vs Median House Value

Bivariate Plots (top 50 Correlations) - Num Total Rooms Vs Num Median Income

Bivariate Plots (top 50 Correlations) - Num Total Rooms Vs Median House Value

Bivariate Plots (top 50 Correlations) - Num Housing Median Age Vs Num Bedrooms Per Room

Bivariate Plots (top 50 Correlations) - Num Total Bedrooms Vs Num Bedrooms Per Room

Bivariate Plots (top 50 Correlations) - Num Population Vs Num Longitude

Bivariate Plots (top 50 Correlations) - Num Housing Median Age Vs Median House Value

Bivariate Plots (top 50 Correlations) - Num Longitude Vs Num Bedrooms Per Room

Bivariate Plots (top 50 Correlations) - Num Households Vs Num Bedrooms Per Room

Bivariate Plots (top 50 Correlations) - Num Households Vs Median House Value

Bivariate Plots (top 50 Correlations) - Num Total Bedrooms Vs Num Longitude

Bivariate Plots (top 50 Correlations) - Num Population Vs Num Bedrooms Per Room

Bivariate Plots (top 50 Correlations) - Num Total Bedrooms Vs Median House Value

Bivariate Plots (top 50 Correlations) - Num Longitude Vs Num Households

Bivariate Plots (top 50 Correlations) - Num Total Rooms Vs Num Longitude

Bivariate Plots (top 50 Correlations) - Num Median Income Vs Num Households

Bivariate Plots (top 50 Correlations) - Num Latitude Vs Num Housing Median Age

Bivariate Plots (top 50 Correlations) - Num Population Vs Num Median Income

Bivariate Plots (top 50 Correlations) - Num Total Bedrooms Vs Num Median Income

Bivariate Plots (top 50 Correlations) - Num Median Income Vs Num Longitude

Bivariate Plots (top 50 Correlations) - Num Total Rooms Vs Num Latitude

Bivariate Plots (top 50 Correlations) - Num Population Vs Median House Value

Bivariate Plots (top 50 Correlations) - Num Longitude Vs Median House Value

Bivariate Plots (top 50 Correlations) - Num Total Bedrooms Vs Num Latitude

Bivariate Plots (top 50 Correlations) - Num Latitude Vs Num Households

Bivariate Plots (top 50 Correlations) - Num Median Income Vs Num Latitude

Bivariate Plots (top 50 Correlations) - Num Longitude Vs Num Housing Median Age

Bivariate Plots (top 50 Correlations) - Num Latitude Vs Num Bedrooms Per Room

Bivariate Plots (top 50 Correlations) - Num Population Vs Num Latitude

Bivariate Plots (top 50 Correlations) - Num Median Income Vs Num Housing Median Age

Bivariate Plots (top 50 Correlations) - Num Latitude Vs Median House Value

Bivariate Plots (top 50 Correlations) - Num Total Rooms Vs Num Bedrooms Per Room

Bivariate Plots (top 50 Correlations) - Num Bedrooms Per Room Vs Median House Value

Bivariate Plots (top 50 Correlations) - Num Population Vs Num Housing Median Age

Bivariate Plots (top 50 Correlations) - Num Housing Median Age Vs Num Households

Bivariate Plots (top 50 Correlations) - Num Total Bedrooms Vs Num Housing Median Age

Bivariate Plots (top 50 Correlations) - Num Total Rooms Vs Num Housing Median Age

Bivariate Plots (top 50 Correlations) - Num Median Income Vs Num Bedrooms Per Room

Bivariate Plots (top 50 Correlations) - Num Longitude Vs Num Latitude

Key Drivers

Median House Value

Median House Value - Feature Scores - Feature Correlation

Median House Value - Feature Importances - From Model

Median House Value - Pca Analysis

Median House Value - Pca Analysis - Pca Projection

Median House Value - Pca Analysis - Correlation With Dimension 2 (y)

Median House Value - Pca Analysis - Correlation With Dimension 1 (x)

Median House Value - Bivariate Plots

Median House Value - Bivariate Plots - Num Bedrooms Per Room

Median House Value - Bivariate Plots - Num Total Rooms

Median House Value - Bivariate Plots - Num Median Income

Median House Value - Bivariate Plots - Num Total Bedrooms

Median House Value - Bivariate Plots - Num Longitude

Median House Value - Bivariate Plots - Num Latitude

Median House Value - Bivariate Plots - Num Housing Median Age

Median House Value - Bivariate Plots - Num Population

Median House Value - Bivariate Plots - Num Households